MD-SQL: A Language for Meta-Data Queries over Relational Databases
نویسندگان
چکیده
Future users of large, interconnected databases and data warehouses will increasingly require schematic transparency of data manipulation systems, in that (i) data from heterogeneous sources must be compared and interrelated and (ii) data must be queried and extracted by distant users having minimal knowledge of its logical structure. A query language that abstracts over meta-data as well as ordinary data is needed. Previous work in this area has resulted in HILOG [1], SchemaLog [7] and SchemaSQL [8]. Although SchemaSQL improves on its predecessors, it remains somewhat informal and relies on a specialized transformation into a fragment of the tabular algebra [10] to give it a viable operational semantics. In contrast, we provide a complete EBNF for Meta-Data SQL (MD-SQL) as a straightforward extension of a relationally complete subset of standard SQL. Like SchemaSQL, MD-SQL allows queries involving meta-data and ordinary data in a multi-database context over potentially disparate platforms. Schematic elements and data are freely interchangeable, and queries are allowed whose output type cannot be known at compile time. Unlike SchemaSQL however, each MD-SQL query translates into a series of simple, atomic operations, each of which is inherently relational. We formalize this translation by presenting a complete meta-algebra which is shown to be equivalent to MD-SQL. Furthermore, we provide some complexity results, in particular that MD-SQL and the meta-algebra yield characterizations of PSPACE. We also give results concerning when the output type of an MD-SQL query can be deduced at compile time. Finally we brie y discuss an implementation of MD-SQL over an ordinary, relational system that uses the DynamicSQL/CLI standard. Since MD-SQL is relational in nature, our implementation can bene t directly from existing query optimization techniques.
منابع مشابه
A Language for Meta - Data Queries over RelationalDatabasesC
Future users of large, interconnected databases and data warehouses will increasingly require schematic transparency of data manipulation systems, in that (i) data from heterogeneous sources must be compared and interrelated and (ii) data must be queried and extracted by distant users having minimal knowledge of its logical structure. A query language that abstracts over meta-data as well as or...
متن کاملAugmenting SQL with Dynamic Restructuring to Support Interoperability in a Relational Federation
In this work, we consider augmenting SQL with constructs allowing dynamic restructuring. Dynamic restructuring occurs naturally in a framework including second-order capabilities for integrating metadata and data within single queries. The context and motivating example for this extension is a federation of relational databases, containing semantically similar information in schematically dispa...
متن کاملProcessing Strategy for Global XQuery Queries Based on XQuery Join Cost
XML is a standard for exchanging and formatting data over the Internet and XQuery is a standard query language for searching and integrating XML data. Therefore, it is a natural choice for interoperability to use XQuery over the Internet. Global XQuery queries search and integrate heterogeneous data, being distributed in the local systems. In order to process efficiently global XQuery queries, ...
متن کاملStandardizing the Querying Process with SGML The SQL DTD
One of the most exciting applications of SGML which has emerged in the recent years is its use in document databases. The structural information embedded in SGML documents makes it possible to query SGML documents and extract information in an automatic manner; however, this querying process has not been standardized. As a result, different SGML database implementations use their own query lang...
متن کاملRDF Modelling and SPARQL Processing of SQL Abstract Syntax Trees
Syntax Trees Corentin Follenfant, Olivier Corby, Fabien Gandon, and David Trastour 1 INRIA Sophia Antipolis Méditerranée [email protected] 2 SAP Research [email protected] Abstract. Most enterprise systems rely on relational databases, and therefore SQL queries, to populate dynamic documents such as business intelligence reports, dashboards or spreadsheets. These queries repr...
متن کامل